Computer Systems Group Design Issues for Latency Hiding on an Access Decoupled

نویسندگان

  • Graham P. Jones
  • Nigel P. Topham
چکیده

Future software and hardware technologies will try to provide improved performance by extracting higher levels of parallelism. However the cost of a main memory access-in terms of missed instruction slots-increases with faster processors and greater issue widths. For this reason latency hiding technology remains one of the most important parts of high performance processor designs. In this paper we investigate a latency hiding technique known as Access Decoupling which partitions a program into two separate instruction streams in order to aggressively prefetch data. We justify a renewed interest in Access Decoupling in two ways. Firstly as a latency hiding technique and secondly as a solution to the problem of hardware complexity in large issue width, out-of-order superscalar architectures. We show that in comparison to a single instruction stream architecture Access Decoupling is marginally more eeec-tive at hiding memory latency and capable of achieving higher performance through its simpler design. After providing our justiication for renewed interest in the decoupling paradigm we quantify the performance impact of diierent hardware/software design issues on Access Decoupled machines. We consider the eeect of restrictions imposed by data dependency analysis, renaming, memory reordering, operation reordering, issue width and 1 synchronisation points on IPC and latency hiding eeectiveness.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Simplifying Hardware for Out Of Order Execution using the Decoupling Paradigm

Future hardware and software technology will try to provide improved performance by extracting higher levels of parallelism. However the cost of a main memory access-in terms of missed instruction issue slots-increases with faster processors and greater issue widths. For this reason latency hiding technology remains one of the most important parts of high performance processor designs. In this ...

متن کامل

The limits of a decoupled out-of-order superscalar architecture

This thesis presents a study into a technique for improving performance in outof-order superscalar architectures. It identifies three technological trends limiting superscalar performance; they are the increasing cost of a main memory access, control dependencies and the greater hardware complexity of out-of-order execution. Decoupling is a technique that can provide higher performance through ...

متن کامل

HiDISC: A Decoupled Architecture for Applications in Data Intensive Computing

The ever growing speed gap between processor and main memory has been a major performance bottleneck of modern computer systems. As a result, today’s data intensive applications suffer from frequent cache misses and lose many CPU cycles due to pipeline stalling. Although traditional prefetching methods reduce cache misses considerably, most of them strongly depend on the access pattern being pr...

متن کامل

Improving Latency Tolerance of Multithreading through Decoupling

ÐThe increasing hardware complexity of dynamically scheduled superscalar processors may compromise the scalability of this organization to make an efficient use of future increases in transistor budget. SMT processors, designed over a superscalar core, are therefore directly concerned by this problem. This work presents and evaluates a novel processor microarchitecture which combines two paradi...

متن کامل

ACACES A Decoupled Access/Execute Architecture for Mobile GPUs

Smartphones are emerging as one of the fastest growing markets, providing enhanced capabilities every few months. However, supporting these hardware/software improvements comes at the cost of reducing the operating time per battery charge. The GPU is only left with a shrinking fraction of the power budget, but the trend towards better screens will inevitably lead to a higher demand for improved...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997